REDUCTION OF NETWORK RETRIEVAL LATENCY USING CACHE AND 
DIGEST 



FIELD OF THE INVENTION 
The present invention relates to a system and method for reducing network retrieval 
latency. In particular, this invention relates to a system and method for reducing network 
retrieval latency using cache and digest. 

BACKGROUND OF THE INVENTION 

Users of the Internet and other network information retrieval systems sometimes 
suffer from excessive latency (or response time). That is, the time between making a request 
for information and the time that the requested information is finally delivered is often longer 
than desired. For a user of a Web browser, for example, such latency may require the user to 
wait for seconds, or even minutes, while the requested information (e.g., text, images, audio, 
or applications) is loaded. A typical Web page retrievable from the Internet contains several 
embedded images. Thus, the latency for presenting an entire page may be the sum of the 
latencies for numerous retrievals (e.g., the HTML file and several GIF or JPEG images). 

The excessive latency problem is especially significant for slower network 
connections (e.g., dial-up via modem or wide-area wireless connections) and for heavily 
congested networks because congestion tends to increase transmission time and therefore 
reduce effective bandwidth. 

There are several prior approaches to achieve latency reduction on the Internet and 
other networked systems. In one approach, data objects are cached at an ultimate client or at 
an intermediate (proxy) cache for subsequent re-use. Caching is not useful, however, if a 
requested data object is not already present in the cache, which happens more often than not. 

In another approach, compression algorithms are used to compress the representation 
of data objects before transmission. Such compressed data objects are then decompressed 
without loss upon reception. In general, most image and audio formats on the Internet are 
already compressed. 



In yet another approach, distillation algorithms are used to remove certain contents 
from requested data objects, so as to make the requested data objects transmissible in fewer 
bytes (e.g., converting a 400x400 pixel image to a 200x200 pixel image). Unlike 
compression, however, distillation is irreversible and may degrade the resulting data object or 
render it useless. 

In yet another approach, prediction, by various means, is performed to automatically 
request in advance data objects that may be requested in the future. Unless the prediction 
algorithm is perfect, however, this approach is liable to result in false pre-fetches and hence 
wasted bandwidth, which can itself increase latency. 

In another approach (a so-called delta encoding mechanism), when a modified data 
object is very similar to a previous instance already held in a cache, the sender can transmit 
the differences between the two instances, rather than the entire new instance. This approach 
saves significant amounts of transmission time in some cases, but not in the majority of cases. 

Frequently, data objects on the Internet or other networked systems appear in multiple 
exact copies, but with different names. For example, the same Compaq logo image might 
appear under different URLs at different servers. Because the URL for each such data object 
is different, traditional caches do not recognize that a request for a data object at one URL is 
satisfiable by the same data object at another URL. 

Thus, it is desirable to provide a system and method for reducing retrieval latency that 
overcome the problems associated with prior approaches. 

SUMMARY OF THE INVENTION 

A system and method for comparing a digest value in an incoming response to a 
digest index (e.g., a hash table). The digest index includes digest values of data objects 
already cached at a cache. Data object retrieval is avoided when a match is found between 
the digest value and a digest value in the digest index. 

Digest values may be calculated using a cryptographic digest algorithm known in the 
art, such as MD5, SHA-1, Rabin Fingerprint, or any digest algorithm that is unlike!; tc 
generate the same digest value for two different data object. Further, in each incoming 
response message, a digest value for a responding data object is positioned before the data 
object itself. For example, the digest value is located in the header portion of an incoming 
response message. 



In one embodiment, this system requires the collaboration between a sender and a 
receiver and an appropriate process executed at the client cache. For example, the client 
cache receives a request (from a client) for the data object at the URL: 

"http://example.eom/x.gif." If the client cache determines that it does not have a cache entry 
for the URL "http://example.eom/x.gif," it forwards the request to an HTTP server at 
example.com. The HTTP server processes the request and returns a response with a header 
portion having multiple header fields as follows: 
HTTP/1.1 200 OK 

Digest: md5=HUXZLQLMuI/KZ5KDcJPcOA= 
Date: Sun, 16 May 1999 02:00:04 GMT 
Content-type: image/gif 

All of the above header names have been previously defined in standards or public 
proposals for other purposes known in the art. The "Digest" header in this example indicates 
that the MD5 digest of the response data is equal to "HUXZLQLMuI/KZ5KDcJPcOA==." 
After receiving the header portion, the client cache parses the header fields in the header 
portion, extracts the digest, and searches a digest index to determine if any currently-cached 
data object has a matching MD5 digest. In an exemplary embodiment, a hash table is used as 
a digest index to facilitate fast look ups of digest value in the response header. If no such 
cached data object is found, the client cache continues its normal behavior and retrieves the 
rest of the current response. If such a data object is found, however, then the client cache 
sends the data object with the header portion to the client and avoids retrieving the rest of the 
current response (e.g., by aborting the connection with the server). 

Thus, a significant retrieval latency reduction is achieved when the response data is 
large and/or when the incoming network connection is slow (i.e., the time between receiving 
the first response header and the final byte of the response data is relatively lengthy). 

BRIEF DESCRIPTION OF THE DRAWINGS 
FIGURE 1 illustrates an exemplary distributed system in accordance with zn 

embodiment of the invention. 

FIGURE 2 illustrates an exemplary server in accordance with an embodiment of the 

invention. 
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FIGURE 3 illustrates an exemplary client in accordance with an embodiment of the 
invention. 

FIGURE 4 illustrates an exemplary proxy cache in accordance with an embodiment of 
the invention. 

FIGURE 5 is a flow chart illustrating an exemplary server process in accordance with 
an embodiment of the invention. 

FIGURE 6 is a flow chart illustrating an exemplary cache process in accordance with 
an embodiment of the invention. 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 
Figure 1 illustrates an exemplary embodiment of a distributed system 100. The 
distributed system 100 includes multiple servers 102-106 and multiple clients 108-1 12 
connected to a network 101 through network interconnections 116. In an exemplary 
embodiment, one or more clients, such as the client 108, may be connected to the network 
101 via a proxy cache 1 14. The network 101 may be a global communication network, such 
as the Internet, or a private network, sometimes called an Intranet. Examples of network 
interconnections 1 16 include switches, routers, etc. For ease of explanation, only a 
representative number of servers, clients, and proxy cache are depicted in Figure 1. A person 
skilled in the art would recognize that on a network, such as the Internet, many more servers, 
clients, and proxy caches can be interconnected. 

Figure 2 illustrates an exemplary server 102 in accordance with an embodiment of the 
invention. The server 102 includes a communications interface 202, a CPU 204, and memory 
208. The memory 208 includes an operating system 210, server applications 212 for 
facilitating server functions, data object files 214 for storing various data objects available to 
clients connected to the network 101, a digest generator 216 for generating digest values for 
each data object in the data object files 214, a digest index 216 for storing a list of previously 
generated digest values, and a communications transport and protocol for facilitating 
communications with the network 101. In an exemplary embodiment, the digest Index 216 
comprises a hash table, a tree structure, or other suitable data structure. In one embodiment 
(not shown), the server 102 does not include a digest index, and therefore calculates each 
digest value as needed. 
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Figure 3 illustrates an exemplary client 1 10 in accordance with an embodiment of the 
invention. The client 1 10 includes a communications interface 302, a microprocessor 304, a 
user interface 306, and memory 308. In an exemplary embodiment, the memory 308 includes 
an operating system 3 10, a browser application 3 12 for facilitating browsing activities on the 
network 101, a cache 314, and a communications transport and protocol 324 for facilitating 
communications with the network 101 . In another exemplary embodiment, the cache 3 14 is 
located in a separate computer system (not shown) and is connected to the client 1 10 via a 
network (not shown). The cache 3 14 includes a digest generator 3 1 6 for generating digest 
values for data objects cached at the cache 3 14, a digest index 3 1 8 (e.g., a hash table or a tree 
structure, etc.) for storing previously generated digest values, a header parser 320 for parsing 
headers of incoming responses from a server 102-106 or a proxy cache 1 14, and data object 
files 322 for caching data objects. 

Figure 4 illustrates an exemplary proxy cache 1 14 in accordance with an embodiment 
of the invention. The proxy cache 1 14 includes a communications interface 402 for 
communicating with the network 101, a CPU 404, and memory 408. The memory 408 
includes an operating system 410, proxy applications 412 for facilitating proxy functions, a 
client list 414 listing the clients serviced by the proxy cache 1 14, a cache 416, and a 
communications transport and protocol 426 for facilitating communications with the network 
101. The cache 416 includes a digest generator 418 for generating digest values for data 
objects cached at the cache 416, a digest index 420 (e.g., a hash table or a tree structure) for 
storing previously generated digest values, a header parser 422 for parsing headers of 
incoming responses from a server 102-106, and data object files 424 for caching data objects. 

Figure 5 illustrates an exemplary server process in accordance with an embodiment of 
the invention. At step 502, a server 102 receives a request for a data object. In one 
embodiment, the request is sent from a client 108 via the proxy cache 114. In another 
embodiment, the request is sent from a client 110 directly to the server 102. Upon receipt of 
the request, the server 102 either retrieves a previously calculated digest value associated with 
the requested data object from the digest index 218 or calculates a digest value ^.g 5 rshcc :he 
digest value is not in the digest index 218) using the digest generator 216 (step 504). Next, 
the server 102 inserts the retrieved or calculated digest value into a header portion of a 
response to the request (506). In a preferred embodiment, the digest value is inserted as early 
as possible in the header portion. Typically, a response includes a header portion and a data 



portion. Further, the header portion generally includes multiple header fields. The header 
portion is sent to the client either via a proxy cache 1 14 or directly (step 508). The data 
portion of the response is sent next unless and until the server 102 receives an instruction to 
abort transmission either from the client or the proxy cache 114 (step 510). 

Figure 6 illustrates an exemplary process performed by the client cache 314 or the 
proxy cache 114. At step 602, a request for a data object is received from the client. In the 
following discussion, the "client" is a client device or computer when the process is 
performed by a proxy cache 1 14. When the process is performed by a client cache 314, the 
"client" is a client program, such as the browser application 312 (see Figure 3). Next, the 
cache (3 14 or 1 14) checks the data object files (322 or 424) to determine whether the 
requested data object is already cached and not expired (step 604). If the requested data 
object is already cached (step 606), that data object is sent to the client as the response to the 
request (step 608). If the requested data object is not cached (step 606), a request for a data 
object is sent to a server 102 (step 610). Next, a header portion of a server response is 
received from the server 102 (step 612). The header portion of the response is parsed for a 
digest value of the requested data object (step 614). The digest value obtained is compared to 
the digest index (318 or 420) for a matching digest value (step 616). If a match is found (step 
618), the server 102 is informed to abort the transaction so that the data portion of the 
response is not sent (step 620). Next, a cached data object corresponding to the matching 
digest value is retrieved from the data object files (322 or 424) (step 622). The retrieved data 
object is sent to the client (step 624). In an exemplary embodiment, steps 620 and 622 can 
be performed simultaneously or in a reverse sequence. Retrieval latency for retrieving the 
requested data object is reduced by supplying the client with an identical data object that was 
already cached in a cache at the client cache 3 14 or the proxy cache 1 14. 

In a request-per-connection model, such as originally used in HTTP, the cache (314 or 
1 14) can simply close the transport connection (e.g., the TCP connection) to stop data 
transmission. Recent improvements to HTTP include a "persistent connection" model, which 
allows several requests to be sent over the same connection to generally improve 
performance. In the persistent connection model, when a connection is closed, all other 
pending requests made during the connection are also terminated and may need to be resent 
later. In some cases, it may be beneficial overall to close the connection to stop transmission 
even if other pending requests will have to be resent (e.g., when the data object being 



transmitted is large). In a preferred embodiment, the server is requested to only stop sending 
a specific response without closing the connection and without preventing the transmission of 
responses to other requests. This feature could be implemented in a future version of HTTP, 
especially in proposed future versions that include a multiplexing mechanism. Mechanisms 
for prematurely terminating a data exchange without closing the transport connection are 
well-known in art. 

Referring back to step 61 8, if there is no matching digest value in the digest index 
(3 18 or 420), the data portion of the response is received (step 626) and the received data 
portion (or data object) is sent to the client (step 628). Also, the received data object is 
typically added to the cache (314 or 1 14) and its digest value is added to the digest index (318 
or 420) (step 630). 

In an exemplary embodiment, the server 102 is configured to add digest values to the 
header portion of a response and to send the header portion of a response first before sending 
the data portion of the response. In another exemplary embodiment, some servers 102 are not 
configured to send digest values in the header portion, but either send the digest value out of 
order or do not send it at all. In this embodiment, upon receiving a server response that 
includes a requested data object but does not include a digest value in the header portion, the 
proxy cache 1 14 calculates the digest value of the requested data object and sends the digest 
value in a header portion of a proxy response to the client cache 314. When the client cache 
314 receives the header portion from the proxy cache 1 14, it parses the header portion for any 
digest value. If a digest value is found in the header portion, the client cache 3 14 then 
compares that digest value to the digest index 318 for any matching digest values. If a match 
is found, then a previously cached data object is sent to the client 1 10. This way, a reduction 
of the potential retrieval latency is achieved because the client does not have to wait for the 
data portion of the response to be delivered from the proxy cache 1 14. 

Digest value calculations are not prohibitively expensive. For example, a 90 MHz 
Pentium can compute MD5 digests at 191 Mbits/sec, which is a much faster speed than most 
current network links. Typically, home users (clients) on dialup, ISDN, or DSL lines 
comprise the slowest paths in the network 110. When the digest value is provided by the 
proxy cache 1 14 to the client cache 314 connected via a relatively slow link, the proxy cache 
1 14 provides the client cache 314 an advance notification that the requested data object may 
be already cached. Thus, the client cache 314 is capable of using this information to provide 



the requested data object before the response to the request is completely delivered and 
possibly abort the delivery. 

Cache management is achieved by mechanisms known in the art, such as the least- 
recently-used mechanism. A summary regarding available cache management mechanisms 
5 can be found in the article entitled "Squeezing More Bits Out of HTTP Caches," Jeffrey C. 
Mogul, May/June 2000 I.E.E.E. Network 14(3):6-14. This article is hereby incorporated by 
reference for all purposes. 

The foregoing examples illustrate certain exemplary embodiments of the invention 
from which other embodiments, variations and modifications will be apparent to those skilled 
1 0 in the art. The invention should therefore not be limited to the particular exemplary 
embodiments discussed herein, but rather defined by the claims appended hereto. 
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